HIDDEN MARKOV MODELS AND LARGE - SCALE GENOMEANALYSISSean

نویسنده

  • Sean R. Eddy
چکیده

PFAM is a database of multiple alignments and hidden Markov models (HMMs) of common, conserved protein domains. PFAM HMMs complement BLAST analysis in the annotation of the C. elegans and human genome sequencing projects at Washington University and the Sanger Centre. PFAM2, based on full, gapped multiple alignments of structural and/or functional protein domains, currently contains 527 models. PFAM/HMM analysis hits at least one domain in 24% of the predicted proteins in the C. elegans genome project. 8% of C. elegans proteins are annotated as multidomain proteins by PFAM, with up to 5 diierent kinds of recognized domains per protein and up to 44 total recognized domains per protein.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

MAN-MACHINE INTERACTION SYSTEM FOR SUBJECT INDEPENDENT SIGN LANGUAGE RECOGNITION USING FUZZY HIDDEN MARKOV MODEL

Sign language recognition has spawned more and more interest in human–computer interaction society. The major challenge that SLR recognition faces now is developing methods that will scale well with increasing vocabulary size with a limited set of training data for the signer independent application. The automatic SLR based on hidden Markov models (HMMs) is very sensitive to gesture's shape inf...

متن کامل

Probability Distance Based Compression of Hidden Markov Models

Abstract. Large-scale stochastic models are relevant in many different fields such as computational biology, finance, social sciences, communication and traffic networks. In order to both efficiently simulate and analyze such models and to understand the essential properties of the system, it is desirable to have model reduction techniques that much reduce the dimensionality of the model while ...

متن کامل

Hidden Markov models in biological sequence

The vast increase of data in biology has meant that many aspects of computational science have been drawn into the field. Two areas of crucial importance are large-scale data management and machine learning. The field between computational science and biology is varyingly described as “computational biology” or “bioinformatics.” This paper reviews machine learning techniques based on the use of...

متن کامل

Online Spectral Identification of Dynamical Systems

Recently, a number of researchers have proposed spectral algorithms for learning models of nonlinear dynamical systems—for example, Hidden Markov Models (HMMs) [1, 2], Partially Observable Markov Decision Processes (POMDPs) [3], and Predictive State Representations (PSRs) [4, 3, 5]. These algorithms are attractive since they are statistically consistent and not subject to local optima. However,...

متن کامل

Scaling Factorial Hidden Markov Models: Stochastic Variational Inference without Messages

Factorial Hidden Markov Models (FHMMs) are powerful models for sequential data but they do not scale well with long sequences. We propose a scalable inference and learning algorithm for FHMMs that draws on ideas from the stochastic variational inference, neural network and copula literatures. Unlike existing approaches, the proposed algorithm requires no message passing procedure among latent v...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 1997